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ABSTRACT 

In the recent years Wikis have become an attractive platform 
for social studies of the human behaviour. Containing mil- 
lions records of edits across the globe, collaborative systems 
such as Wikipedia have allowed researchers to gain a bet- 
ter understanding of editors participation and their activity 
patterns. However, contributions made to Geo-wikis — wiki- 
based collaborative mapping projects — differ from systems 
such as Wikipedia in a fundamental way due to spatial di- 
mension of the content that limits the contributors to a set 
of those who posses local knowledge about a specific area 
and therefore cross-platform studies and comparisons are 
required to build a comprehensive image of online open col- 
laboration phenomena. In this work, we study the temporal 
behavioural pattern of OpenStreetMap editors, a successful 
example of geo-wiki, for two European capital cities. We 
categorise different type of temporal patterns and report on 
the historical trend within a period of 7 years of the project 
age. We also draw a comparison with the previously ob- 
served editing activity patterns of Wikipedia. 

Categories and Subject Descriptors 

H.2.8 [Database Management]: Database Applications 
- Spatial Databases and GIS; H.5.3 [Group and Orga- 
nization Interfaces] : Collaborative computing, computer- 
supported cooperative work 

General Terms 
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1. INTRODUCTION 

In the recent years, the research on wikis and open, online 
collaboration systems has attracted a great deal of attention 



from social scientist, aiming to analyse behaviour of a large 
number of individuals and the interaction between them in 
a collaborative environment in details. The outcome of such 
research not only could provide a deeper understanding on 
human collaboration and its intrinsic features in general, but 
also could have great applications in innovating and improv- 
ing new and existing collaboration platforms. A common 
finding has shown that in many of collaborative systems such 
as Wikipedia, the distribution of editors and contributions 
is highly skewed, with the majority of contributions coming 
from only about 2.5% of contributors 10 . Wikipedia as a 
predominant example of mass collaboration has been stud- 
ied vastly and from multiple aspects, e.g., size and growth 
rate [17[ [2], confiict and editorial wars |16| [22], user rep- 



utation and article quality 
readability [20|. 
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linguistic features and 



However, despite the great popularity that geo-wiki systems 
received in the last years, they have seen far less research at- 
tention, with the exception of recent studies on accuracy and 
coverage of OpenStreetManj where the accuracy has been 
shown to be high pi M and coverage to be non-uniformly 
distributed 23, 8]. However, users behaviour in this domain 
is by far less investigated subject and thus requires more re- 
search effort to be well understood. In this regard, Hristova 
et al [6] has taken the first steps to quantitatively understand 
users participation in geo-wikis. The authors show that in 
OpenStreetMap the distribution of editors and contributions 
are as skewed as Wikipedia, with 95% of contributions made 
in the area of London, UK, attributed to less than 10% of 
users. Also Panciera et al '11, found that in CyclopatnJ 5% 
of its users are responsible for the majority of its content. 
These results are even more interesting when one consid- 
ers that geo-wikis systems are fundamentally different from 
classical web-based wikis as their spatial dimension limits 
the contributors to a set of those who posses local knowl- 
edge about a specific area (i.e., have physically visited the 
place). For this reason, not all the properties investigated 
on classical web-based wikis could be assumed to hold for 
geo-wikis too. 

Furthermore, the recent popularity and widespread adop- 
tion of smart-phones has brought forward classical wiki sys- 
tems as a candidate worth considering in the urban domain 
too, with citizens becoming cartographers, with geo-wikis 
such as OpenStreetMap. Indeed, these services offer mobile 
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applications that users can deploy directly on their smart- 
phones to generate geo-tagged content on the go. Given 
these characteristics, the question we then ask ourselves is: 
what are the temporal activity patterns of geo-wiki editors 
and how they differ from those of previously highlighted in 
Wikipedia? We seek answers to this question to not only 
discover interesting patterns in human dynamics, but also 
to learn the extent to which the rise in use of mobile de- 
vices has changed the work-flow of collaborative mapping 
projects. The effects and potentials of transformation to- 
wards "mobile crowdsourcing" could be well examined by 
studying characteristics of such platforms in details, with 
great implications in other areas related to ubiquitous and 
social computing. In this direction, this paper takes the first 
steps in identifying the temporal behavioural pattern of geo- 
wiki editors. In particular, it focuses on OpenStreetMap 
(OSM) , a very successful example of geo-wikis which boasts 
over one million users collectively building a free, openly 
accessible, editable map of the world. 

Temporal patterns of human behaviour has been studied 
extensively in various fields. In the domain of web and net- 
working, the circadian patterns of the Internet traffic have 
brought interesting information about individual habits of 
the Internet usage in different societies, assisting with util- 
isation of infrastructure |15[ 111 [14]. In domain of social 
networking and collaborative systems, various works have 
addressed the burstiness nature of user's behaviour both in 
terms of voluntarily contributions [22] and edge creations be- 
tween social network users 4]. Finally, Eagle et al. j3] have 
studied the temporal behavioural pattern of human by mon- 
itoring communication, location and interactions of 100 test 
subjects. They extracted common "eigenbehaviour" struc- 
tures from their data by using principle component analysis. 

In this paper we adopt similar methodology and analyse 
the activity logs of OSM editors for two European capital 
cities of London and Rome for a period of 7 and 5 years 
respectively. We follow the principal component analysis 
to identify the repeating temporal structures of editors in 
our data, where typical structures are represented by eigen- 
patterns. Finally, we study these behavioural structures over 
different stages of OSM. Our results show that despite the 
differences between the nature of contributions of geo-wikis 
and wikis, the OSM editors generally exhibit similar tem- 
poral behavioural pattern of contributions during the day 
as Wikipedians. Furthermore, we show that over the past 
years the editors behaviour has shifted towards afternoon (3 
p.m. to 6 p.m.) activities from the late night activities per- 
haps corresponding to the success of OSM as a smart-phone 
application. However, this latter has a slower dynamics in 
the case of Rome compared to London. 

2. MATERIALS AND METHODS 

Data. The OSM dataset is freely available to download and 
contains the history of all edits (since 2005) on all spatial 
objects performed by all users. Spatial objects can be one of 
three types: nodes, ways, and relations. Nodes broadly refer 
to POIs, ways are representative of roads, and relations are 
used for grouping other objects together. For the purpose of 
this study, we selected two cities to analyse: London, UK, as 
an example of a very well represented large metropolitan city 



in OSM, and Rome, Italy, as an example of a large city which 
is steadily increasing its spatial representation in OSM. Fur- 
thermore, we focused only on nodes; in fact, we expect that 
editing pattern of nodes differ from those found in classical 
wiki systems due to simplicity of adding/editing a node on 
the fly in a more ubiquitous fashion. Finally, to consider only 
genuine users' contributions, we have looked into contribu- 
tions that most likely correspond to bulk imports. Many 
bulk imports were detected in the whole dataset, with tens 
of thousands of edits done in a single day by a single user, 
spread throughout Greater London (e.g., more than 20,000 
post boxes spread across all Greater London appeared in 
OSM in only one day in 2009 from the same user). We 
chose to discard such data for the reason that we intend to 
model genuine 'bottom-up', user-generated contributions, of 
which massive imports are not representative. The datasets 
we are left with are summarised below: 



City 



London 
Rome 



# Users #Edits Time Window 



607 534361 

563 146000 



7 years 
5 years 



Method. To capture the editing patterns of editors in OSM, 
we compute two different measures: 



• Circadian cycles. We analyzed the periodic character- 
istics of temporal patterns of editors for each hour of 
24 hours day by considering the ratio of the average 
number of edits that has been done within the consid- 
ered time window on the average number of edits that 
has been done within all the 24 hours of the day, as 
specified below: 



circadiani 



avg{^Edits^t) 
avg{#Edits) 



where, avg{^EditSAt) is average amount of edits oc- 
curred during the specific time interval At of the day 
and avg{#Edits) is the average number of edits oc- 
curred during the whole day. 

• Eigenbehaviours. We used the principle components 
analysis (PCA) of our data focusing on time of the 
day. The principle component analysis has been ex- 
tensively used in the past to identify the behavioural 
patterns of users mobility [sj [l2] . Similarly, we adopt 
the same methodology in which we define the principal 
components as a set of vectors that span a 'behaviour 
space' and characterise the behavioural variation be- 
tween users. More precisely, we create a matrix M 
where each row corresponds to a unique user in our 
dataset and each column corresponds to the time unit 
under study. Each entry mij of our matrix Ai con- 
tains the average ratio of edits that the user Ui has 
made during the time units tj. Exactly as before, as 
time unit we considered one hour time interval. For 
each column of this matrix, we compute the empiri- 
cal mean and the deviation of each column from this 
mean, after this we computed the covariance matrix B 
of the previous matrix. Finally, we extract the repeat- 
ing behaviour from this data structure by calculating 
the eigenvectors and eigenvalues of B. These extracted 



eigenbehaviours are the heavily weighted vectors that 
represent a type of repeated behaviour, such as edit- 
ing during evening, night or afternoon. In order to 
get a robust set of eigenvectors, we needed to fiher 
out editors with few edits leading to abnormal circa- 
dian patterns with picks of larger than 20% of all edits 
within a one hour window. 

3. RESULTS 

We first start by analysing the periodic characteristics of 
temporal patterns of editors inferring the circadian cycles 
of OSM edits. We considered every single edit performed 
since 2005/2007 on London/Rome and used the timestamps 
assigned to each edit to calculated the overall activity of 
users for the time of day. Figure [l] illustrates the circa- 
dian pattern. It is interesting to observe that the editing 
activities are to their minimum at early morning around 4 
a.m., followed by a rapid increase up to noon. The activity 
shows a slight increase until around 9 p.m., where it start 
to decrease during night. This result corresponds closely to 
the circadian patterns previously observed in Wikipedia [2l] 
as well as qualitatively corresponding to the other kind of 
human activities such as mobile calling behaviour [7], and 
the Internet instant messaging [l3]. However, Rome editors 
show more activity around noon and less in the afternoon 
compared to London editors. 

0.08 



the eigenvectors extracted from London data only. 

V, 
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Figure 1: The circadian pattern of edits based on 
the aggregated data for the whole dataset period. 

We further investigate the circadian activity patterns by per- 
forming PC A. The first 3 principal components Vi,V2,V3 
extracted from the individual activity pattern of London 
editors are shown in Figure |2] The sum of the 3 correspond- 
ing eigenvalues exceeds 80% of the sum of the all spectrum, 
justifying the reduction of the space dimension from 24 to 
3. It is possible to see that these 3 component roughly cor- 
respond to 3 type of editorial pattern: (i) Vi corresponds 
to afternoon edits where the peak is from 2 p.m to 6 p.m, 
(a) V2 corresponds to morning and noon edits and finally 
where the peak of edits falls from 7 a.m to 2 p.m. (lii) V3 
corresponds to night edits where the peak of edits fall from 
6 p.m. to midnight. Note that the eigenvectors indicate the 
direction of the most typical deviations from the average be- 
haviour. We repeated the same analysis for the Rome data 
which resulted in a very similar pattern. However as the 
smaller size of the Rome dataset contributes to the level of 
noise dramatically, we proceed with our analysis based on 
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Figure 2: Three Top Eigenbehaviours constructed 
from individual activity patterns of London OSM 
editors. 

In the next step we build up activity vectors for all edits 
within each year for both London and Rome and project 
them on the identified eigen-behaviours Vi , V2 , V3 . The pro- 
jected values ci, C2, C3 show to which extent the activity vec- 
tor could be reproduced by the corresponding eigen-behaviour. 
Ci's calculated for London and Rome for different years are 
shown in Figure |3] In both cases a shift towards afternoon 
edit is evident; however faster in the case of London. Fur- 
thermore, we observe a subside effect in the edits associated 
to night time activities. We speculate two reasons for these 
trends. Firstly the widespread of Internet services on de- 
vices such as smart-phones in the recent years, may have 
had an impact on the temporal behaviour of users as they 
become less limited to stationary computers (e.g., at work 
or home). It is interesting to note that the lesser extend of 
afternoon projection for Rome can similarly be explained by 
a recent studjPlwhich has highlighted a much lower adoption 
rate of smart-phones in Italy (about 11%) in comparison to 
the UK (70%) market. Secondly, as OSM system ages, the 
saturation effect on maps may mean that less mass contribu- 
tions from power-users is required, thus a decrease in night 
activities (i.e., a trend usually associated with power-users). 

4. CONCLUSION 

In this work we studied the temporal behavioural pattern of 
OpenStreetMap editors as a representative example of geo- 
wiki systems. By performing Principal Component Analysis, 
we extracted the typical editorial behaviour temporal pat- 
terns. We showed that although OSM editors generally ex- 
hibit similar temporal behavioural pattern of contributions 
during the day as Wikipedians, but over the past years the 
editors behaviour has shifted towards afternoon activities 
from the late night activities perhaps corresponding to the 
success of OSM as a smart-phone application. However, this 
trend has a slower dynamics in the case of Rome compared to 

^http://tinyurl.com/ylj9bvb 
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Figure 3: The trend of projections into the identified 
eigenbehaviours (shown in Figure [2]) over years for 
London and Rome editors. 



London. We believe these results not only shed light on the 
underlying mechanisms of collaborative mapping projects, 
but also could suggest elaborative strategies in order to in- 
crease the viability and quality of the project; facilitating 
more mobile contributions being one example. Note that 
higher quality of London map compared to Rome has been 
already reported |9., in accordance with our observation of 
larger use of mobile devices by its users. 

As future direction, we plan to go beyond this study by con- 
sidering in our temporal analysis not only the daily pattern 
of user edits, but also the difference on daily pattern dur- 
ing working days and weekends for different user categories 
(e.g., power-users, early adopters). Furthermore, to gain 
a deeper understanding of geo-wikis editors behaviour, we 
plan to divide our data into roads versus POIs, hypothesis- 
ing that editing POIs requires less skills and attention than 
roads, for this reason they may exhibit a different temporal 
pattern. 
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